Building a Large Grammar for Italian
نویسندگان
چکیده
We describe the construction of a large lexicalized tree adjoining grammar for Italian, automatically extracted from an annotated corpus. We first introduce the TUT, a dependency style treebank for Italian, then we illustrate the algorithm that we have designed to extract the grammar, and finally we report two experiments about parsing complexity and coverage of the extracted grammar.
منابع مشابه
Building a Generator for Italian Sign Language
This paper presents an ongoing work about the implementation of a CCG grammar for Italian Sign Language. This grammar is part of a generation system used for Italian-LIS translation.
متن کاملBuilding a Wide Coverage Dynamic Grammar
Incremental processing is relevant for language modeling, speech recognition and language generation. In this paper we devise a dynamic version of Tree Adjoining Grammar (DVTAG) that encodes a strong notion of incrementality directly into the operations of the formal system. After discussing the basic features of DVTAG, we address the issue of building of a wide coverage grammar and present nov...
متن کاملA Dependency-based Algorithm for Grammar Conversion
In this paper we present a model to transfor a grammatical formalism in another. The model is applicable only on restrictive conditions. However, it is fairly useful for many purposes: parsing evaluation, researching methods for truly combining different parsing outputs to reach better parsing performances, and building larger syntactically annotated corpora for data-driven approaches. The mode...
متن کاملLost in Grammar Translation Lost in Grammar Translation
1http://www.di.unito.it/∼tutreeb/ Italian Treebank2 (VIT), and the ISST3. None of them is comparable in size with the English Penn Treebank. This limits the possibility to have reliable induced grammars for Italian. Initial studies have shown that probabilistic grammars induced on a small corpus have not impressive performances [5]. Building larger corpora is then needed. We have been working o...
متن کاملA lexical analysis of Italian clitics
In this paper, I will propose a lexicalist analysis of Italian cliticization, which is based on the assumption that Italian clitics exhibit affix behavior. I will show that this analysis can deal both with the syntactic properties of cliticization and with their morphophonological properties. In particular, I will suggest that Italian clitics merge together into a morphological unit which combi...
متن کامل